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A SURVEYING APPARATUS AND METHODS 

THEREOF 

BACKGROUND OF THE INVENTION 

FIELD OF INVENTION 

The following disclosure relates generally to correlating statistical records and more 
particularly to correlating compensation records to unique individual profiles. 

BACKGROUND OF THE INVENTION 

Today, many reports are available that allow a user to find, read, purchase, or 
otherwise acquire reports on worker compensation. Most often these reports indicate 
average pay rates by industry, job type, locale, and sometimes they report more 
specific information about a particular industry or job, such as bonuses, stock options, 
average workweek, or immigration status, among other things. To create such 
compensation reports two approaches are typically used. One such approach is for a 
human analyst to research and find a statistically valid number of individuals with like 
characteristics, and devise a suite of compensation reports. This process is tedious, 
labor intensive and often expensive. For a truly detailed report the analyst must be 
relied on to do substantial investigation and synthesize and apply this information to 
the case at hand. Compensation consultants with years of experience and resources 
can generally accurately profile an individual's worth in the market place. However, 
such an analysis is extremely specialized and out of the reach of the typical consumer. 
Simpler and less costly reports are available but they are generally broadly classed 
and offer little utility. 

Simpler and less costly reports are available but they are generally broadly classed 
and offer less utility. The majority- of software-based analysis provides a less 
expensive alternative but yields correspondingly limited information. Compensation 
services using current computer analysis programs generally gather data using some 
form of questionnaire and then feed the appropriate data into a computer database or 
spreadsheet. Or more typically, generalized data, such as from the US Bureau of 
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Labor and Statistics, are used as a base and then extrapolated based on region and 
date, and often combined with third party surveys. Typically, a computer then is 
instructed to run an analysis of the data to provide statistical information such as 
averages, medians, and standard deviations on pre-determined groups of people. 
5 However, the information provided is not unique to an individual, but instead is a 
conglomeration of data that the program feels best represents the individual. Because 
the categorization of the individual is based solely on a limited, predetermined set of 
responses to the questionnaire, it offers little to no opportunity for evaluating unique 
characteristics. For example, an automated compensation service may categorize and 

10 calculate data showing that the average yearly salary of a "Computer Programmer 
Level 3" in Washington state is $64,250. This may or may not be applicable to a 
"Senior Application Software Engineer" with ten years of experience and special 
training in the skill C++, but because the closest answer describing the Senior 
Application Software Engineer's position in the initial survey was a "Computer 

15 Programmer Level 3," the Senior Application Software Engineer has thus been 
categorized ineffectively, which removes any unique abilities he may possess. 



The Senior Application Software Engineer reading the aforementioned report cannot 
be sure how closely the published report figures apply to himself individually. There 

20 are a multitude of factors that affect any one individual's job compensation. The current 
generalized reporting methods for compensation reports cannot and do not incorporate 
factors that provide for an accurate job comparison and compensation analysis for 
individual users. Today's methods require the user to gauge or self-approximate 
themselves to a group of people being reported. Typically, such approximations are 

25 grouped by a specific job title that a human compensation analyst predetermines when 
creating a report or when designing a computer service that eventually generates the 
report. This grouping is generally not an exact match with the user's actual job title and 
responsibilities and often has little applicability to the users individual qualities. For 
example, the compensation analyst might have created a report for an isolated group 

30 called Computer Programmer Level 3. For individuals who possess the same 
characteristics as the data files used to create this group, the reports generated from 
such a compensation analysis are reasonably accurate. However, for individuals 
possessing unique capabilities, experiences, skills, or talents, the reports are 
essentially useless. The data are by definition misapplied because any differences in 
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the compared data are arbitrarily reflected in the compensation report. This introduces 
doubt on the user's part as to how closely he can trust the report's applicability. 

Current compensation analysis techniques do not provide users with affordable, 
accurate, and personalized compensation reports. Job specific variables, critical to the 
accurate assessment of an individual's worth, are not correctly identified or uniformly 
applied. Furthermore, individuals within a particular field are unaware of the value of 
certain, often easily obtainable, qualifications. There is a need, therefore, for a system 
and method to provide online compensation reports using a more flexible survey 
system that produces dynamic profiles based on unique individual attributes and 
automated comparisons, and reports that account for these attributes. 

SUMMARY O F THE INVENTION 

The invention overcomes the limitations of the prior art and provides additional 
benefits. Under one aspect of the invention, profiles are used to produce individualized 
compensation reports. A survey engine is used to produce profiles of individuals that 
identify the individuals' unique characteristics. The survey engine incorporates a 
collaborative filtering engine that determines appropriate questions to ask the user 
during the survey, and also provides suggested possible answers. Additionally, the 
system allows for the use of open-text questions. Open-text questions allow for new 
answers to be input by the user, without the prior need for an administrator to pre- 
define the possible values for the system, as is typical in prior art. The system 
incorporates affinity groups around profile attributes (question answers), providing a 
basis for gauging similarity of profiles for various comparison and aggregation 
purposes described herein. A collaborative filtering engine is incorporated, which is 
both periodically modified by an administrator, and also tuned by users themselves 
based on their actions and responses. New affinity groups (associations of profiles) 
are .ncorporated by the survey engine to suggest new questions and possible answers 
m a survey. Additionally, some affinity groups are generated automatically by the 
system, and finally by users themselves to create hew interesting relationships among 
profiles. 

The collaborative filtering engine enables the capture of profile attributes that are 
targeted compensation variables concerning a profile. The system automatically 
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incorporates new profiles into existing affinity groups, which allows more targeted 
survey questions and possible answers to be determined without requiring constant 
human training or intervention. 

5 Without a collaborative filtering system, the system is unable to administer surveys 
accurately for users who do not fit into pre-defined categories, it is impossible to 
categorize every occupational variation because the system allows for open-text 
answers to questions, and because the system provides for different questions to be 
determined automatically and asked for differing types of user job profile, the 

10 collaborative filtering system, along with affinity groups and other requirements 
described herein, is employed. Because the collaborative filtering system allows for the 
system to make educated guesses within defined constraints, the system can handle 
new categorizations more effectively than a system wholly defined by a human 
administrator. Additionally, in this system, the administrator defines constraints that 

15 prohibit the survey from asking obviously wrong or out of place questions. An example 
of a constraint is requiring that if the user does not answer any of the suggested 
questions, a default question is always asked. 

A new and surprising effect of using a collaborative filtering system to define a survey 
20 is that the system is accommodates a much larger population of data using a more 
targeted survey. Previous implementations of compensation surveys relied on a 
smaller sample size, a broad survey with generalized questions, or a larger base of 
analysts to design and conduct surveys and categorize the data. Thus, the survey was 
constrained purely by the human resources needed to conduct it. The invention 
25 described herein is not constrained as such and requires far fewer human resources to 
conduct such a detailed survey across many different job categories. 

Another key aspect of the invention includes the ability to search through the data that 
have been collected by the survey engine, using detailed search criteria defined by a 
30 search definition document. The document applies a scoring and filtering mechanism 
that returns the most appropriate set of profiles based on the user's goal for the 
analysis. The search document is useful because it allows an administrator to define 
natural relationships between profile attributes, such as those that apply routinely to 
the realm of compensation analysis of various classes of profiles. The document is 
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then interpreted by a software algorithm, and used during the retrieval of relevant 
profiles, which can then be used for tallying and reporting. Multiple search definition 
documents can be created quickly, each one for a different goal. For example, one 
search goal may weight skills and certifications very highly. The results are useful for 
analyzing how to compare people with similar technical skills. Another search definition 
document weights experience and education higher, which is useful in seeing how a 
user compares to those profiles who have similar experience and education levels. A 
third example holds location constant, and compares a user to all other profiles with 
matching attributes in the same location. 



Another aspect of the invention includes the automated ability to summarize and 
present the results of a profile search in a format that is useful to a human for 
compensation comparisons. A chart is defined as a series of values, such as skills, 
paired with a series of measures, such as average salary, median salary, standard 
deviation, etc. A sample chart is called "Average Salary By Skill" and it lists each Skill 
along with an associated average salary. A report is defined as a series of charts 
combined to provide an overall picture and analysis for a user. For instance, consider 
a report that aims to discover how a user compares with regard to skills and 
experience in similar jobs. This report incorporates many charts, as defined previously, 
which are combined into a format that gives a user a good analysis and understanding 
of the results of the user's search goal. 

Prior art approaches are unable to automate the selection of charts within a 
compensation report, such as determining if "Average Salary by Practice Area" is 
applicable to a report presented for a particular user profile (it may only be applicable 
to a Lawyer, for instance). The prior art requires that the administrator know in 
advance, every chart that is relevant for a particular user. This shortcoming restricted 
previous inventions in one of two ways: limited the number of charts available, such 
that all charts were available to all compensation reports, regardless of their relevance, 
or they predefined different compensation reports by industry or job category; which 
would require a large amount of labor. 



The invention described herein determines if a chart is relevant to a user based on the 
user's own profile, and displays it, and also only shows it if enough data exists for it to 
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be valid statistically. There may be thousands of different charts of many different 
types and based on many different attributes and measures, and only a subset of them 
may apply to a particular users' compensation analysis. For instance, a lawyer may 
wish to see a report concerning "Average Bonus by Number of Hours Billed per Year," 
but this report is not applicable to teachers or CEOs. In the prior art, each of these 
compensation reports is compiled by a human analyst, created at expense, or details 
are ignored, leaving only the most common charts in the compensation report, such as 
"Average Salary by Occupation and Location," which are often the least useful to an 
individual trying to compare his compensation against his peers. The invention herein 
allows for a more targeted and relevant compensation report by focusing in an 
automated, scalable fashion, on attributes that are unique to a user, in addition to the 
attributes that are most common to all users. 

One skilled in the art would also recognize that such a system could be used to match 
15 individual profiles to resumes because the profiles described herein are typically a 
subset of data gathered for a resume. For example, a user entering a profile into the 
system described herein could be easily matched to resumes using the same 
mechanisms employed to match against other profiles. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is an overview of a website implementation of the system according to the 
invention; 

Figure 2 is a general user flow through the system according to the invention; 
Figure 3 is a specific user flow for the first time a user accesses the system according 
25 to the invention; 

Figure 4 is a flow for the system to suggest a FieldGroup to a user according to the 
invention; 

Figure 5 is a flow for saving a user's answer to a database according to the invention; 
Figure 6 is an exemplary profile according to the invention; 
30 Figure 7 is an example of the system suggesting a FieldGroup in the survey 
Figure 8 is a continuation of Figure 7 

Figure 9 is an example of the system suggesting popular answers for a FieldGroup, in 
the survey according to the invention; 

Figure 10 is a continuation of Figure 9 according to the invention; 
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Figure 11 is a flow for calculating and summarizing data according to the invention; 
Figure 12 is a flow for storing tokenized data in a database according to the invention; 
Figure 13 is a flow for populating an affinity group according to the invention; 
Figure 14 is a flow for generating a custom report for a user according to the invention; 
5 Figure 15 is an overview of the different aspects of a profile search according to the 
invention; 

Figure 16 is a flow for a profile search according to the invention; 
Figure 17 is an example of an affinity group according to the invention; 
Figure 18 is a flow for the rules engine according to the invention; 
10 Figure 19 is a series of rules used in the rules engine according to the invention; 

Figure 20 is an exemplary apparatus in accordance with the disclosed invention being 
further connected to a network according to the invention; 

Figure 21 is an exemplary flowchart for the purpose of creating a user specific affinity 
group according to the invention; 

Figure 22 is an exemplary flowchart showing the profile search process according to 
the invention; 

Figure 23 is an exemplary XML code for profile matching according to the invention; 
Figure 24 is an exemplary flowchart the matching of a user profile to an affinity group 
according to the invention; and 

Figure 25 is an exemplary flowchart for reporting of results of a matching process. 

DETAILED DESCRIPT OF THE INVENTION 

A method and system for providing targeted online compensation reports that 
15 accounts for unique individual job characteristics by using dynamic profiles is 
described in detail herein. In the following description, numerous specific details are 
provided for survey flow, affinities, levels, suggest FieldGroup, suggest popular 
answers, save answers, profile search, scoring system, report aggregation, and rules 
engine. One skilled in the relevant art, however, will recognize that the invention can 
20 be practiced without one or more of the specific details, or with other symbols, 
methods, etc. In other instances, well-known methods or techniques are not shown; or 
are not described in detail, to avoid obscuring aspects of the invention. 
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Definitions 

Field - A single piece of information, corresponding to a particular question asked by 
the system. Examples, "City," "State," "Bonus Amount," "Skill," and "Job Title." 

5 FieldGroup - A set of related Fields that form a logical grouping of information into a 
single record. Also often referred to as a Question. Examples: "Salary" is a FieldGroup 
consisting of "Salary Amount," "Currency," and "Average Workweek." "Job Location" is 
a FieldGroup consisting of "City," "State," and "Country." 

10 Survey - A set of FieldGroups asked in a particular order. An administrator may fix the 
order, or the system may determine the order using the "Suggest FieldGroup" 
algorithm defined below. 

AnswerValue (also Value, Answer or Attribute) - A value (piece of information) 
15 corresponding to a Field, used as a response. Example: For field "City," 
AnswerValues could be "Seattle," "San Francisco," "Miami," or "Paris." 

Profile - A set of FieldGroups, Fields and AnswerValues that form a logical 
representation of an individual person or group of people. Example: 

20 "lndustry=Finance, Job=Accountant, Employer-Name=Ernst & Young, 
Certification=CPA, Certification=CMA, Specialty=Taxation, Specialty=Cost Accounting, 
Years-ln-Field=12, Salary-Amount=60,000, Salary-Currency=US Dollars, Salary- 
Workweek=40 Hours, Location-City=Baltimore, Location-State=Maryland, Location- 
Country=USA, School-Name=Princton, School-Degree=Masters, School- 

25 DegreeYear=1995, Age=43, etc." 

Affinity Group (or Affinity) - A grouping of profiles that share common profile attributes, 
and further as shown in Fig. 17. Example: "All People who work in Law or Legal 
Professions." 

30 • ■ 

Affinity Definition - A Boolean representation of the common attributes shared by an 
affinity group. Example: For "All People who work in Law or Legal Professions", the 
affinity definition might be: "Job=Lawyer OR Job=Attorney OR Job=District Court 
Judge or lndustry=Legal OR Specialty=Trial Law OR etc." 
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Profile Search - A detailed set of criteria used for matching profiles to other profiles 
and creating a scoring system that ranks the validity of the match. 

5 Profile Search Document - A document that encompasses the criteria defined for the 
profile search. 

Chart - An aggregation of data limited to an affinity group, in a format understandable 
to an end user. Example: Bar Chart for "Average Salary By Specialty." 

10 

Report - A set of charts combined in a specific layout to provide a detailed analysis of 
a profile search and comparison goal. 

Overview 

15 Every individual possesses unique distinguishing characteristics in their employment 
profile. These unique characteristics, even when seemingly minor, can correspond to 
differences in employment compensation. Being able to have a custom report that 
compares relevant characteristics for each individual, and having an understanding of 
what the market is willing to pay for the individual's abilities based on such 

20 comparison, is an important step in finding and effectively negotiating employment 
opportunities, as well as making informed career decisions. The method and 
apparatus presented herein provide comparative compensation reports based on 
characteristics determined through a survey and a scored-attribute-matching search 
and reporting process. 

25 

Reference is now made to Fig. 2 where a general user flow through the system is 
shown. A user begins the investigation of their worth by accessing an Internet Website 
through a user interface, such as a personal computer, personal data assistant, or 
similar device. Once access has been established, the user conveys to the system his 
30 desired compensation comparison objectives/goals, as shown in Fig. 1. Once the 
objectives have been identified a survey engine, also referred to as survey wizard, 
begins identifying the unique characteristics of the user that are most applicable to 
determining a compensation level. The survey engine uses a combination of open and 
closed FieldGroups to create an individual profile. An open FieldGroup is defined as 
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having at least one Field for which the user may enter an answer in free text. A closed 
ReldGroup is defined as having a Fields for which the user may only select from a list 
of predefined choices. After each FieldGroup is answered during the survey process 
the individual profile is associated with one or more affinity groups. The affinity groups 
are d.screte groupings of individual profiles based on Boolean criteria. The Boolean 
criteria, either simple or complex, are based on a combination of values from one or 
more attributes on a profile, such as job, skill, location, etc. A user's profile attributes 
are compared against each affinity definition, and if the profile attribute meets the 
aff.n.ty definition criteria, the user is considered to be a member of that affinity group 
For example, a user may be considered part of the affinity group "People in 
Information Technology/Computer Networking" if they have answered having certain 
skills, such as SkiH=TCP/IP, Skill=Cisco Routers, Skill=Windows NT Networking, or 
Certification=Microsoft Certified Systems Engineer. 

The survey engine, or system, discussed in more detail below, through iteration asks 
F.eldGroups of the user until the engine determines no more FieldGroups should be 
asked. Upon beginning the survey process, the user's objectives/goals are confirmed 
through an initial set of questions, as shown in Fig. 1. The goal establishes broad 
areas that should be investigated by the survey engine. The engine, using this goal 
suggests the next FieldGroup (Question) to be posed to the user, from the set of all 
available FieldGroups in the system shown in Fig. 4. The FieldGroup along with 
possible popular answers obtained from previous questioning of different users is 
presented to the user. The system examines the answer offered by the user and if 
appropriate, saves it in a database, as shown in Fig. 5. This process continues until 
the engine determines that the desired goal has been ascertained and no more 
FieldGroups need to be asked. The engine queries if any more goals need to be 
exammed returning the process to establishing a goal for Questioning. The process 
continues as described herein until profile data for all the applicable goals are created 
and the survey is complete. 

The system uses affinities or affinity groups to categorize and group users in many 
segments of the application. An affinity group is defined as a group of profiles defined 
by a set of profile criteria, called an affinity definition. Affinity definitions are a 
combination of values from fields defined in the system also shown in Fig 6 For 



WO 2004/061740 PCT/US2003/041413 

- 11 - 

example, Job=Human Resource Manager is an affinity definition for the affinity group 
consisting of all the profiles such that users answered Job=Human Resources 
Manager. Criteria can be logically defined to create more complex criteria, using 
standard Boolean operators such as AND and OR. For example, the affinity group 
5 called "San Francisco Java Programmers" might have the following affinity definition: 
"City=San Francisco AND (Job=Software Programmer or Job=Software Developer or 
Job=Web Developer) AND Skill=Java." The corresponding set of profiles that match 
this definition is the affinity group. The affinity definition is stored in a relational format, 
which is easily and quickly retrievable, and searchable. One skilled in the art would 

10 recognize that such a definition could be stored in a wide variety of formats suitable to 
the task at hand, or modified for improved CPU performance. A program quickly 
compares the user's profile to all affinity definitions stored in the system, and 
determines to which affinity groups a user belongs. One skilled in the art would be able 
to recognize that more complicated affinities are possible, using more complex 

15 Boolean logic such as NOT, XOR, etc. Additionally, affinities can be created which 
group numeric data by ranges. For instance, all people in a certain age range could be 
grouped together. Affinities are associations of data, defined by a human, an 
administrator, or users, which allow the system to create a more intelligent output for 
use by many of the sub-systems described herein. Additional detail is provided below 

20 in the form of flowcharts. 

The survey engine is required to ask FieldGroups that are relevant for a particular 
user. Many different job profiles have different attributes that affect one's 
compensation. For example, a CEO may need an answer to the FieldGroup (question) 

25 Company Revenue in his profile, while a Lawyer might need Practice Area or Bar 
Association Memberships. The suggest FieldGroup algorithm is designed to ask 
pertinent questions of the user depending on the user's compensation analysis goal 
and the type of profile the user has, learned through iterative questioning. Open-text 
questions provide a unique challenge in automated systems because an administrator 

30 cannot anticipate them. For instance, the prior art allows for a selection - of ah industry " 
from a drop-down list box containing a set of known values. Because all industries are 
known to the system, logic could be added to recommend a new question based upon 
the answer to the previous question. In such a system, a complex decision tree is 
created in advance, and the system is able to create a custom survey based upon the 
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answers given by the user. However, these systems are subject to the limitation of 
knowing all possible answers, and spending a significant amount of labor to program 
the decision tree to account for all possibilities. It also makes it very difficult to add 
new questions because the added responsibility of mapping each answer to the 
decision tree must be part of the process. The invention described herein is not subject 
to such limitations because it allows the entry by users of any answer. Typically, this 
open-ended question is enabled by a text box with few restrictions on what can be 
entered, versus a drop down box, where a user must select one of the pre-defined 
answers. 



Because the system allows open-ended text answers, a standard expert system is not 
possible because it is impossible to create a decision tree for an infinite number of 
questions that would make up a survey. In such situations in other art, an artificial 
intelligence mechanism is usually employed to deal with the unknown variances in 
user input concerning which questions the users answer, as well as what those 
answers are. However, the invention requires that a high degree of recall is 
maintained, which means a new user coming and using the system with the same set 
of profile data as an existing user, should have the same set of FieldGroups 
recommended in the same order (deterministic). To accommodate these requirements, 
i.e. both recall and variability, the system employs a system that defines relationships 
between FieldGroups, a minimum set of required FieldGroups, and an order that the 
FieldGroups appear during the survey. These system rules are combined with a set of 
weights and a subsystem which allows a qualified guess as to which FieldGroups 
should be suggested next, without requiring a predefined survey for each unique type 
of job profile. 



To pick the next FieldGroup to be asked of the user, the survey engine selects the 
most popular FieldGroup that the user has not yet answered, but that is related to the 
FieldGroups already posed to the user, and which is also contained in the profiles of 
the affinity group(s) to which the user currently belongs, subject to the level constraints 
and FieldGroup relationships described herein. An administrator establishes the 
relationships and levels for FieldGroups, prior to a user completing the survey. As 
explained earlier, the system uses a collaborative filtering architecture for the survey 
engine to allow for a large number of FieldGroups and minimal administrative input. 
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Although a collaborative filtering architecture is used, an alternative architecture such 
as a neural network might also be employed. Administrative tasks related to 
Suggesting FieldGroups are limited to cleaning input data that is used to teach the 
system, defining FieldGroup Relations, defining FieldGroup Levels, and defining 
5 affinity groups. 

The inputs to the survey engine are a user's previous answers, affinity groups, 
relationships between FieldGroups (FieldGroup relations document), FieldGroup levels 
(or FieldGroup priority), and a triplet called "Popular FieldGroups," that consists of an 

10 affinity group, a FieldGroup, and a weighted value. The output for the survey engine is 
a single suggested FieldGroup. The constraints in the suggested FieldGroup system 
consist of FieldGroup levels (or priorities) and defaults. The survey wizard prioritizes 
the related FieldGroups by first selecting FieldGroups that possess the highest level 
(or priority) as defined by a level document for the particular wizard or broad goal. 

15 Each FieldGroup is assigned one and only one level per wizard. Each level is assigned 
as a positive integer value to a FieldGroup, with lower levels constraining FieldGroups 
to appear earlier in the survey, and higher levels constraining the FieldGroup to appear 
later in the survey. This ensures that certain FieldGroups appear at or near the 
beginning of the survey, and other FieldGroups appear at or near the end. 

20 Additionally, the level document also groups FieldGroups together in each level, and 
assigns one FieldGroup from the group as a default FieldGroup, so that if an 
administrator wanted to ensure that at least one FieldGroup out of a group of 
FieldGroups is always asked, it is asked if no other FieldGroup from that level is 
asked. For instance, there may be two separate FieldGroups, "Job" and "Position." 

25 Position may be a specialized type of job, which is appropriate to ask in certain 
industries, such as for a professional baseball player. In that case, job need not be 
asked because that information is captured in the position Fieldgroup. For the most 
part, most profiles only need to answer job, whereas in specialized cases, they may 
need to answer alternate FieldGroups such as position. Therefore job and position are 

30 grouped, and job is set by an administrator as the default FieldGrbup~becau'se the 
system requires that if people do not answer any other job-like question, they must 
answer job. 
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The weighted values on the Popular FieldGroups triplet are stored in a relational 
format. The weights represent each FieldGroup's popularity. This is defined as the 
number of profiles for each affinity group that have answers for that FieldGroup. 
Popularity is calculated by tallying up, for each FieldGroup, the number of users who 
have answered that FieldGroup, for each affinity group, and an association between 
an affinity, a FieldGroup, and a popularity, is stored in a relational table. By this 
mechanism, a feedback loop is created between the user's profile questions and 
answers, and the survey wizard. As user profiles are entered into the system via the 
survey process, new associations of similar profiles are produced and those results 
are integrated into the popularity of each corresponding Fieldgroup, yielding a 
subsequently more and more precise survey for differing types of job profiles. 
Additional flowchart discussion is provided below. 

Constraints are generally applied to the system based upon naturally occurring 
15 relationships between FieldGroups, as they pertain to job profiles in general. For 
example, it is assumed that the existence of certain FieldGroups presupposes other 
FieldGroups. A human administrator with expertise in the domain defines these 
relationships based upon domain knowledge. Because the number of relations 
between FieldGroups is far fewer than the number of unique survey possibilities, it is 
20 efficient for an expert to define these relations, even if there are hundreds of 
FieldGroups. For instance, a FieldGroup such as "Bar Association" (generally used for 
lawyers and profiles requiring a legal degree) is only asked if the user's "Job," or 
perhaps their "Industry" is known, but it is pointless to ask that FieldGroup if only the 
user's "Gender" is known. As such, a FieldGroup such as "Bar Association" is related 
25 to "Job" and "Industry," but not to "Gender." Relations such as these are referred to in 
the system as "FieldGroup Related," and are stored in the system using a relational 
format. In addition to the levels, these relations act as constraints on the determination 
of the output of the suggest FieldGroup. 

30 Given the constraints and weights that are resident in the system, the survey engine- 
selects amongst all the FieldGroups available for the survey, the FieldGroup that is the 
most popular (or highest weighted) FieldGroup among the affinity groups to which the 
user belongs, constrained by the levels and FieldGroup relations described previously. 
Finally, the FieldGroup is forwarded to the to the user for presentation. The user 
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answers it, and the process is repeated for subsequent questions. If, after iteration, a 
FieldGroup no longer can be suggested that meets the criteria discussed herein, the 
system marks the users survey as complete and takes a pre-defined next action, such 
as showing a message to the user and moving on to the reporting aspects of the 
system. 

Reference is now made to Figs. 9-10 that show a non-limiting example of the system 
suggesting popular answers for a FieldGroup in the survey. A mechanism exists in the 
system, which suggests possible answer choices for a particular FieldGroup, based 
upon several factors. These factors are, for example the FieldGroup being asked, an 
answer relations document, the user's profile and associated affinity groups, and a set 
of weights, which store the most popular answers for a particular FieldGroup. Just as 
there is a relational basis between FieldGroups asked during the survey process, one 
aspect of the survey engine establishes a relationship between a particular FieldGroup 
and the suggested answers. A particular FieldGroup is answer related to another 
FieldGroup if the answer to the first FieldGroup causes popular answers for the 
second FieldGroup to be suggested. 

The 'suggest popular answers' algorithm is constrained by the answer relations and 
the answer values are weighted using a table consisting of a list of associations 
between a value, an affinity group, and the number of profiles who have answered that 
particular value. Among the constrained answers, the survey engine selects the X 
most popular (most highly weighted) answers among the list, where X is an integer 
defined by an administrator as a reasonable number of values to display for a user to 
25 select from. 

For example if a survey begins with a FieldGroup for Industry', and the FieldGroup is 
displayed, which asks the user about the Industry they work in, and they respond 
"law," a second FieldGroup for Law Firm, based on the process described earlier for 
suggested questions, is displayed and asks the user with which law firm they are 
associated. Based on the Suggest Popular Answers algorithm, the engine can also 
prov.de a list of law firms that were previously provided by other users to this 
F,eldGroup, based on those who also identified themselves in the law Industry The 
affinity group Industry = "law" is, therefore, answer related to the FieldGroup for Law 
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Firm because the suggested answers for the FieldGroup Law Firm are related to the 
answer provided by the first FieldGroup Industry. Correspondingly, for the FieldGroup 
Law Firm, the system looks up all other FieldGroups that are answer-related to it. For 
example, the type of job (FieldGroup for Job) may be answer related to the affinity 
5 group associated with law firms, as may be the FieldGroup for "total hours billed per 
year," etc. Therefore, because FieldGroup Job may be answer-related to FieldGroup 
Industry and FieldGroup Law Firm, the system suggests to the user to select his type 
of job from a list that includes corporate attorney, litigation attorney, paralegal, and so 
forth. While, if the user had answered "Computer Software" for the FieldGroup 

10 Industry, then the system would have suggested different possible answers for 
FieldGroup Job, such as computer programmer, senior software engineer, IT support 
technician, etc. Alternatively, instead of using the suggested popular answers, the user 
may enter a new value, and input a job title that is completely new to the system. 
Based on the user's responses, the system categorizes the users profile and aligns it 

15 with profiles that possess similar characteristics. 

When the survey wizard displays a question, one aspect is to determine if the 
FieldGroup is an open FieldGroup, supporting open-text answers. An open 
FieldGroup is one that allows for free text entries by a user, as opposed to a closed 

20 FieldGroup, where a user may only select values from a list or use numeric answers. 
If the FieldGroup is in an open format, the user may type a open-text answer, or select 
a suggested popular answer as described previously. When the user has made their 
entry, the system determines if the answer was typed as open-text or selected from 
the list of possible answers. If the answer is selected from a list of suggested answers, 

25 the users choice is saved in a database using the existing Answer ID for that existing 
answer in the system. If the answer was free/open-text, an algorithm is invoked to 
determine if the answer already exists in the database in a similar form to the users 
free-text entry. Using search technology, possible existing alternatives from the 
database are suggested that possess similar characteristics as the free-text answer. 

30 The alternative responses may possess different spelling of key words, grammatical 
variations, or combinations of other synonymous words. By doing this, the survey 
wizard tries to identify the underlying focus of the new answer and ensures that the 
system is gaining the right perspective from the user. If the user rejects the suggested 
alternatives, the system requests that the user retype the answer to confirm the 
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response, and possibly enter more information about the answers, so that the system 
or an administrator may categorize the new answer. Once the user confirms the 
response by typing it a second time, the new answer is saved in the database and it 
becomes one of the possible answers that subsequent users may choose. 

An administrator, or automated program, can also verify that any new answers are 
qualified values, such as not containing swear-words or other anomalies An 
automated program may exist which uses a dictionary, or other software that can verify 
the validity of the value. If instead of a new open-text answer, the user selects one of 
the alternative responses, the selected answer is saved in the database using the 
existing Answer ID for that answer. Occasionally, the FieldGroup presented to the user 
.3 not of an open format, such as a small list of possible responses that do not change 
(are not open), or a numeric value, such as a salary figure, or a date, or Boolean 
values such as Yes/No. In numeric or date situations, there are bounds defined as well 
for answers. If the answer to the FieldGroup fits into the bounds of the FieldGroup 
definition, the user's response is saved in the database. For instance, if the user is 
answering a date field, the date may be required to be within a certain range or if a 
salary, it must be greater than U' If the answer to this closed form of FieldGroup is not 
within the bounds of the FieldGroup definition, an error message is sent to the user 
ask.ng for clarification or re-input of the answer. Ultimately, the users profile is saved 
and cataloged in a database that allows the system to correlate it with other affinity 
groups and other user profiles to provide a comprehensive compensation report. 

Reference is now made to Fig. 16, where a flow for a profile search is shown. A major 
aspect of the invention concerns comparing an individual profile to all other profiles in 
the system, and determining which of these profiles are most similar. The Profile 
Search algorithm returns a list of similar profiles. This similar list of profiles are then 
summarized and aggregated into a readable report that offers a complex analysis of a 
user's compensation and career opportunities. This functionality allows the system to 
return an .n-depth, customized report consisting of analysis of similar profiles in real 
t.me. It represents a more accurate picture of a user's compensation than a broad 
survey could, or one done by human analysts, or based only on job titles or a 
standard expert system, etc. The Profile Search algorithm runs, which returns a list of 
profiles to which the user's profile matches most c.osely to the search goal. Profiles 
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are retrieved from the database as a set of FieldGroup,Field,AnswerValue triplets. The 
system encapsulates and defines similarity between profiles by use of a scoring 
mechanism. The scoring mechanism is generic in nature, and can apply to any 
attribute of a profile that has been defined in the system. It is important to note that 
5 different types of similarity can be established by the system. This allows different 
search goals to be executed. For example, a user may wish to know what projected 
salary they may expect to make in the marketplace, in their particular occupation, with 
the all other attributes, such as experience, location and industry, being similar. This is 
a common scenario for individuals wishing to switch jobs. The invention supports this 
10 type of analysis using a pre-defined search goal document, which specifies that the 
user's profile is. to be compared rta'a set <bf .prpfifes;tha^ iri ihe 'sanjfei 

area; industry;; etc.^ 
jdbs^r:which#ey^^ 
jcase£,differen^ 

15 

Reference is nowVm 
fehcajpisulateba^ 
relatjpnalistrilicture, : usm^ 

0xacf matches are considered,; as Th^ ^affinity majchbs^j 

20 although scored jov^ »^j^;tfi>^j|^m,1^u#.J^^ 
silow^fofawide^ 

ti&fflt^vfojRfr 'sol ves i; a;criticaf prpbierh; in thi^area of invention, ilei "preyiqu&i 
incarnations of occupational dbmparison systems require exact matches, and hence - 
lose much of their. value, bebause a large percentage of real world profiles that are-i 

25 similar do not usually have values with exact matches, or the number of true variations / 
is much higher than is usually accounted for in fixed reporting systems. For instance^ 
an IRS auditor may wish to be compared to an accountant, but because they have i 
different job titles, they "may never be compared in other systems, in itfiisi 
implementation, affinities (as described elsewhere) "are constantly being added and] 

30 updated through- manual and^^^a mese. vajueil 

together, arid hence, return similar profiles.; 



When a profile search . is, requested, the search goal definition document for the search ! 
is ^iev^^f'-s^l^&sfMeB'fDr how .profile attributes are to be matched;: and:: 
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compared: Jhie "user's profile, is then compared fo y aj| : other profiles; in the system, iand; 
each matching Value from the profiles is 'assigned^a score determined by .the fteldj; 
which it matches on. For instance, exact matehes pn the Fje 
a certain" relatively : high score. An affinity match ^ey^AirPebpJe with^ccpunfeg 1 Jpbs,' i . 
is also considered' because thm^:nmtchf8;^re similar but not, exactly, the same; ! 
iBecause pfthis, these matcheslare given a sljghtly lower, score. Each profile is then \ 
scored based- on all of is totaled ^p^i 

summing up the. individual scores. ; One .skilled; .in the art can understand that May J 
different types search gpa^ 
type ^ FieldGrqu^ The FieldGraup:^ 
m£tehscpres^ 

^ scores foi§ob;:tb create very, accurate rankings, of prpfil|sV returnedijn Jh^se^rch; i 
to ^et a .given search : g^ i1jp be returned for a search (qualify), a profile must meet 
a certain overall threshold score, which is predefined by an administrator as part of the 
search goal document. Additionally, the profile must contain matches on specific 
deterministic fields. The threshold score removes any profiles that do not match on 
enough attributes to be considered a high quality match. For instance, it is possible 
that a profile matches on many non-deterministic values, such as gender, geographic 
location, or experience level, but does not match on something that is critical (or 
deterministic), such as job type or specialty. An accountant may live next to an 
attorney, in the same age range, in the same community, who went to the same 
college, but they should not be considered matches because they are in different jobs 
and different job affinities. Therefore, in one type of search goal, only certain fields and 
affinities, such as those related to the job or specialty, are considered deterministic 
enough to be considered for matches. Again, one skilled in the art will understand that 
while Job and Skill are considered highly deterministic for basic compensation 
analysis, the system could be employed for other types of analysis where possibly 
other FieldGroups, such as Location or Gender, might be specified as highly 
deterministic instead. This allows for the definition and automation of many kinds of 
reports meeting many different profile search goals. - - 



Additionally, some combinations of attribute matches may be considered more 
valuable than other combinations. For instance, it may be preferable to match a 
smaller subset of profiles for which a location is closer to the target profile. By 
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employing matchgroups within the profile search goal document, sets of matches can 
also be ranked. This is useful, for instance, in ranking profiles higher where geographic 
proximity is desirable. By creating a set of affinities (described elsewhere) that group 
together profiles by regions, the profile search system using matchgroups targets 
5 profiles within local regions first, ranking them higher than profiles that meet other 
match criteria but are in outlying regions. For instance, the search goal may first only 
find profiles in the same or surrounding cities. By defining metropolitan region affinity 
groups, one can find people in the same metropolitan region. However, there may not 
be enough valid data in the system to find profiles in the same metropolitan region, 

10 and therefore, one must also consider profiles from a larger surrounding region, such 
as a state, or multi-state area, and choose them if necessary. By including affinities in 
a ranked fashion in matchgroups, it is possible to return profiles that are closest to our 
own by having them rank the highest. One skilled in the art would recognize that 
closeness could apply to any attribute of a profile in addition to location, such as 

15 experience level or age ranges. 

Reference is now made to Fig. 14 where another aspect of the invention, a set of 
profiles (usually but not necessarily retrieved via the profile search subsystem 
described elsewhere) is aggregated and their individual attributes are automatically 
summarized into calculations, such as averages, medians, standard deviations, and 
counts. The calculations are aggregated for fast real-time retrieval. Aggregate 
definitions define dimensions (such as skill) and measures (such as salary) into an 
aggregate (such as average salary by skill). Additionally, the system can summarize 
values into predetermined ranges, such as salary ranges or age ranges. A report chart 
format defines how charts are displayed to an end. usir/l^an^dlflereni^And^; such ! 
as HTML, PDF or JPEG, exist, and the report output format rhay : b&adj^tedja>«§&'! 
wlth;«riy;of these : fo^sits^nd;.trii^. > chart8' may be displayed as bar charts,: pie charts, :; 

30 An administrator defines an aggregate ^definition, for example 'i^^^Xf^lbrtnaii '; 
consisting: of measure and a. dimension and a name. Each aggregate is calculated ) 
over an affinity group or groups: For example, after a profile search; is executed, the list < 
of resultant profiles, is cornbined into ah "affinity group, such as u Pe6|ile5M^tihg Search. : 
Goal X for User Y," and" all .availabje report definitions ai^;ei»c]jjtei^;vi^^^^l| : 
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numerous aggregations. Each aggregate definition*- must contain a measure or , si 
dimension, or. both. '4f hq; measure is specified, the aggregate is calculated as a couhti 
oyer the entire dimension. If no dimension is specified; the average, 25th, 50th, 75th 
rhedian, standard, deviation, and „^ncl.axd,j|jTOr J /e&;r.-^ calculated ;.6wr'f£fo ; 9ji&j&i 
5 population using standard algorithms. Aggregisite^for^ahy 7 ' affinity group can .be! 
Calculated using this methbo\ Using this method allows fast retrieval of aggregate ! 
information, and easy /definition ^of new aggr^afipns^^'lj'i^ are. avaNab|e to. any : of ; 
thousands of affinity groups defined in the System:; 

10 Because hundreds of aggregate. defini over thousands of affinity - 

groups, it is impossible to present all qf this data tb-abumeh in, a fdrmat which they ; 
could easily and q uickly r understand 'because^ tiie , resulting" data.;, w 
thousands of pages, To ;s;bivefWiis prqblem^4 inyejitibti ;grqujpi ; aggre^atiqns¥intp;j 
charts a^°'-igVdui^ian^'ln1l^^ aWd arrarigesvthe charts? m • aMa^utlwhiqhf i§1 

1 5 understandable andiiuseful '^fMf^Mi^^^ : ^M'^> '"Hi&^n&iigl^a^^c^'^ 
icharts that ar4 determined to "be appH^ 
This, allows for ^^^^ 

any time, without h^ to predefine a : ! report for any particular group. For instancli- a '■■ 
group named "Pay Ihfluencers'r m|ghti cp ."^yerage Salary by ; 

20 Jpbf . "Average Salary By?:Practice Area," vSv^rage Salary By iTeachirig ;MrM1 
"Average Salary By Skill^^Ayetegp S|!^iy; R^e b$ 

Bar Association etc; ;But n the charts In this group are displayed, depending i 

bn the applicability to fhe : user's profile..; For example, the! chart "Average S^fary; By ; 
Teaching Rank*' is hot displayed if the report is for a lawyer's profile.; Another grdup^ 

25 named "Geographic Outlook" might contain charts; such as "Average Salary By Cityi'i 
;br "Average Salary By State;*' 

Charts are grouped into logical sections that are recognizable to a human, and the ' 
system renders the charts in a grouped layout, so as to be -in ..an understandable, : 
30 cohesiveT readable presentation. For a particular set of similar profiles- there are many? 
charts, that are defined according to a report specification, * A ^ :seriqs not charts; are i 
combined into a group which -is then parsed and formatted by a program into a User! 
readable format, such as HTML, 
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FPr ihstande, a lawyer's profile may have .matched to many other attorneys, air Who ! 
have answered a FieldGroup for Bar AssoGiatipn and ^:Reld^roup:forKo'u#i|illing^ 
Rate. The^cpmpiled report for the lawyer can autpmatically show a chart for "Average ■ 
Billing Rate by Bar Association," whereas a report for a High School Teachers wouid i 
5 !not; show this chart .becau>e;^p6- ! m.atcniftg profiles have either answered the "Bar . 
Association" FieldGroup or; the "Hourly Billing; Rate" FieldGroup: This is r an important • 
innovation over previous aRproadhes, . Which. returning generic.; 
assessments; that, g^ 
subtler,: yet far more useful 

10 

R^fecence is nbW madeto Fig, : 1i8^h^ engine js shown. A v r 

engine is set up to process p^^ 

P^bfems; and faulty d^ system dfefined; by an^expPtt! 

on the domain. Because the: 

1 5 user jnput, -a.s well a^ freely input nUmencrdata, rulesJare'put in.place to autpmaticaNy i 
moriitbr th^t daita for validity. This helps to automate the prbcessof dafe 
allbWs, an administratonto review laiie nUrribe profiles more ^efficierrtly; ;F:pfy 

instance; a iiser may enter data that is obyipusly bpgusi/such a^ ] 
tod high, or-too low for tl^ curhency ^type and cpur^sucii* as '0.' Also,, in spniei cases! 

20 the rutes engine 

The rules engine. is; implementedrusing. asef oY'databajs^ 
and stored procedures that c^ 

take some action as a result. Each of these queries is run agaihst ; each new?profiie j 
25 input into the system by users, : ahd -fhe resultsrof the rules engine queries, including a : 
list of changes determined by the rules, are stored in a table, The rules engine allpws j 
stored procedures tp be iset up Which can check for any type of data error: i 

Another example of a rule is if a user enters too many values for a particular ! 
30 FieldGroup; suclv¥s"Sk7ii; If a use> r has answered'tbormany^Tfe- chances are'tRat- 
some of those, skills are, not valjd' or the user is just playing with the system,; arid 
hence, the profile js invalidated, and not considered for profile searches and reports/ A ; 
flag on the profile marks a profilers Active pr inactive. Inactive profiles are not used 
any other part of the Syste;m 'for calculations, including compjarisons in^^he.sUivey ; 
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engine, search goals,; or fepqt&jr'^fc^y^dstaZ&e' cleaned and maintained- as ; 
statistically valid. Many other flags -'are r also ; set : on a profile, and one skilled! in the art 
can easily recognize that these flags are useful ways 'tp^gauge profiles'; use in various-: 
aspects of the system. Rules in this form can also be used to modify existing profiles i 
for data entry mistakes.. For instance,; a user may commonly enter a new! open-text] 
value for one FiejdGroup that:i 

yaiid: as ppssjbl^irt^niop^n ^stenj^uefras this, the rujeV engine_moyes the answer;! 
from one FieidGroup. to • aripth'er. pne can a 

extended ,to any situation that applied profiles.: 
Referringrnpw to F 

(hereinafter^ I 
plurality^ of ^ ^ft.'j^l|riH^1^x^^e' ; cljenp£$ 

i1 , the: -purpose tdf] 

establishing "a" personal, ranking against, other users who^ , 
apparatus f1t> ; pr otherf apparatus . user pf;a cp.m.pensatipn purvey; 

implementation pi^h appafatd^ 

wish . to; establish :a"iro§RectJye -salary aCa ;cef fein- geographic; area: fin another' 

embodimeritpf^he lnyenTtibM ^^M'^M4^&^i1^li rpay baJntelesmd- 
in, reviving ;repprts?pr^ 

previously answered; jhe .sriryeik For e^mplef jn"t^ 

i^plementatipn^bf the apftari^ wish to r^ceiyea report of the j 

average salary of !a specific jPb^hctipn^ or a report of the average salary expectatiph j 



The Apparatus 110 is connected to the users 120, for example client 120-1, by means 
of a network 130. The network 130 may be, but is not limited to, a local area network 
(LAN), a wide area network (WAN), or the world-wide-web (WWW). Through the use of 
Web pages and standard communication protocols, a client 120 may easily access the 
apparatus 110 for both surveying and reporting purposes. In one embodiment, the 
apparatus 110 comprises a central processing unit (CPU) 112, a work memory 114, a 
code memory 116, and a database storage 118. The CPU 112 executes computer 
code stored, for example, in the code memory 116, and which enables among others 
the handling of the communication protocols between the apparatus 110 and the 
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clients 120, the display of Web pages, search of the content of the database, 
preparation of reports, and the like. Work memory 114 is used by the CPU 112 for its 
operation. The database 118 retains user profiles, a plurality of questions and 
corresponding answers, report definitions, affinity groups, periodic reports, and the 
5 like. 



Reference is now made again to Fig. 1, where the details of a Website implementation 
of the system 100 incorporating the apparatus 110 are further discussed. There are 
three basic building blocks to the system: display components 210, business 

10 components 240, and data components 260. The display components 210 are handled 
by the client 120 and the apparatus (or server) 110. The client 120 provides the display 
capabilities by using a Web browser 220. The apparatus 110 provides the display 
capabilities through the use of the hyper-text transfer protocol (HTTP) 230. Display 
components 210 enable the visual display of a survey and reports as may be required 

15 by a client 120. The system 100 further comprises business components 240 which 
have core components 245 and support components 250. Core components 245 
comprise at least a survey engine 247, a search engine 248, and a reporting engine 
246. Support components 250 may include, but are not limited to, commerce 251, 
security 252, administration 253, and content management 254. Business components 

20 240 enable the system 100 to provide the survey and reporting services in accordance 
with the disclosed invention. The data components 260 are further comprised of a 
structured query language (SQL) server database 261, a full text catalog 262, and one 
or more extensible markup language (XML) configuration documents 263. These 
components provide the data as may be required for the proper operation of the 

25 system 100 and particularly the apparatus 110, in accordance with the disclosed 
invention. 

Returning to Fig. 20 it should be noted that other than Web browser 220 on client 120, 
all the other components described in Fig. 2 may reside on the apparatus 110. 
30 However, a person skilled in the art would be able to adapt the disclosed architecture" 
such that certain modification are made which provide for specific system 100 needs. 
For example, the database 118 may be distributed and accessible through the 
network 130. In another embodiment of the disclosed invention, a plurality of 
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apparatus 110 handle the surveys and have access to a plurality of respective 
databases 118. 



The code memory 116 contains* in addition to other codes, a code that is targeted for 
5 the matching process required for the purpose of matching a user to a plurality of other 
users and establishing the user's unique affinity group. In contrast to prior art solutions, 
the invention disclosed herein provides for a dynamic creation of affinity groups, in 
addition to the comparison to previously stored affinity groups. The advantage of 
automatically detecting and creating affinity groups is in the fact the multiple 

10 parameters filled out by the user as part of the survey allow for increasingly accurately 
assessing the affinity group or groups with which a user should be associated with. A 
user of the system is therefore able to match his profile to that of others such as 
himself, and determine, for example, an expected salary for a certain job description or 
geographic area. However, the method is not limited for job related surveys and these 

15 are provided merely as an exemplary and no-limiting usage scenarios. A person skilled 
in the art would further note that the methods disclosed herein below may not 
necessarily require the exemplary apparatus disclosed above, and such methods may 
be applied to user profiles and affinity groups provided from other means, automated, 
manual, or combinations thereof. 

20 

In accordance with the disclosed invention, it is necessary to perform a series of 
matches between user profiles, for the purpose of providing customized reports. In one 
embodiment of the disclosed invention, a profile entered by a user is matched against 
a plurality of user profiles stored in the system. More specifically, the entered profile is 

25 matched to other profiles according to "matchgroups" (explained in more detail below) 
to determine if the profiles are similar, or like, each other. It is therefore possible to 
create a list of user profiles that are like the profile entered by the user, otherwise 
known as an affinity group. Using matchgroups, it is further possible to match a 
plurality of profiles against a goal, where a goal may be a set of parameters to which a 

30 subset of the plurality of user profiles matches best according to a defined score arid" 
score threshold. It is further possible to match an entered profile to one or more affinity 
groups having known characteristics. In one embodiment of the invention such 
matching is performed at every stage of the answers provided by the user as the 
survey questions are presented and responded to. In another embodiment, such 
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matching is applied after a completed survey, taking into account all attributes of the 
entered user profile, and producing a resulting group or groups of matching profiles 
according to the overall matching goal. 

In accordance with the disclosed invention a matchgroup may be defined as a plurality 
of match fields and a plurality of affinity groups, each (match field, affinity group, and 
matchgroup) having a score value. If the entered profile has values matching other 
profiles' values according to the matchgroup, the matchgroup score is increased by the 
score value defined for the matching field. If the entered profile has values matching 
within an affinity group, the matchgroup score is increased by the affinity score value. 
The plurality of scores is tallied, allowing the determination of an overall score for the 
matchgroup. Furthermore, the process is repeated for any number of defined 
matchgroups having potentially different match fields and affinity groups, allowing the 
determination of an overall score for a defined goal. Matching profiles may be then 
included or excluded depending on threshold score values defined for this purpose, 
such as "Excellent Match," "Good Match," or "Bad Match." The process of matching is 
hereinafter described in greater detail. 

Reference is now made to Fig. 21, where an exemplary and non-limiting flowchart 300 
20 for the purpose of creating a user specific affinity group is shown. In step S310, a user 
profile is received (hereinafter the "received user profile"). This may be a user profile 
that was prepared based on an automated survey system, discussed in more detail 
above. In step S320 a score is associated with each of the possible match groups as 
shown above. A match group has a plurality of fields, affinity groups, and respective 
25 weights or scores, which allows for combinations of attribute matches, may be 
considered more valuable than other combinations. For instance, it may be preferable 
to match a smaller subset of profiles for which a location is closer to the target profile. 
By employing matchgroups within the profile search goal document, sets of matches 
can also be ranked. This is useful, for instance, in ranking profiles higher where 
30 geographic proximity" is desirable. By creating a set of affinities designed to relate" 
profiles by regions, the profile search system using matchgroups targets profiles within 
local regions first, ranking them higher than profiles that meet other match criteria but 
are in outlying regions. For instance, the search goal may first only find profiles in the 
same or surrounding cities. By defining metropolitan region affinity groups, user 
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profiles of people in the same metropolitan region oan be found. However, there may 
not be enough valid data in the system to find user profiles in the same metropolitan 
region, and therefore, user profiles from a larger surrounding region, suoh as a state 
or multi-state area, oan -be considered. By including affinities in a ranked fashion in 
matchgroups, user profiles that are closes" to compared to user profile may be located 
such located user profiles having the highest score ranks. One skilled in the art would 
recograze that closeness could apply to any attribute of a profile in addition to location 
such as experience level, job category, company type, age ranges, and others. In step 
S330, a search is performed matching a plurality of user profiles to that of the received 
user profile. As a result of the search, a list of scores is returned for each of the 
compared user profiles. In one embodiment of the invention, the list is returned 
ordered from the highest to the lowest score. A more detailed description of the search 
process is provided below. In step S340, any number of score fitness ranges is 
assigned, for example, a score range defining an excellent fit, a score range defining a 
good fit, and a score range defining below acceptable fit. More specifically, if the score 
is from '0' to '100' then excellent fit may be scores of '90' through '100/ good fit having 
scores of 70' through '90,' and so on. In step S350, a report of the profiles best fitting 
the received user profile is provided, identifying those profiles that best fit as a new 
affinity group. In another embodiment of the invention, the created affinity group is 
checked against the already existing affinity groups and, if there is a match, no new 
affiraty group is created. An advantage of the disclosed invention over prior art is in the 
creation of an affinity group, which includes at least the received user profile This 
allows for a larger match scope than the static assignment of an affinity group 
commonly found in prior art solutions. A person skilled in the art may modify the 
disclosed flowchart to flt with the needs of specific programming languages, e.g. C ++ 
or description languages, e.g. XML, without departing from the scope of the disclosed 
invention. 

Referring to Fig. 22, an exemplary and non-limiting flowchart of step S330 of Fig 3 for 
the profile search process is shown: The flow chart explains the search and matching 
process of step S330 discussed in Fig. 3 above. In step S330-10 a user profile to be 
matched to the user profile received in step S310 is selected (hereinafter the "selected 
user profile"). The selected user profile is one that was not previously selected during 
the specific execution of the current search and matchgroup. ,„ step S330-20 the 
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match group score of the selected user profile is determined in comparison with the 
received user profile. The match score may be a binary value, for example, '0' 
meaning no match and '1' meaning match. It is also possible to have a sliding degree 
of fitness with multiple fitness score values, for example, '0' = no match, T = bad 
5 match, '2' = acceptable match, '3' = good match, and '4' = excellent match. The 
specific match score is correlated further with the score value that is indicative of the 
weight a certain match field may have. As noted above, certain match fields may have 
a higher importance for the overall score than others, and hence have a higher score 
value, or effect, on the overall score result. It is further possible to limit the score, for 

10 example to a maximum score, for any matchgroup, so that a multitude of matches of a 
particular field do not skew the overall match score. In step S330-30, it is checked if 
more match fields within the same user profile are to be checked and if so execution 
continues with step S330-20. Otherwise, execution continues with step S330-40. In 
step S330-40, an overall fitness score respective of the user profile in comparison to 

15 the user profile received in step S210 is calculated. This is a weighted value that takes 
into consideration all the relevant matchgroups. The overall result is used as a score 
indicative of the level of fitness between the two user profiles. In one embodiment of 
this invention these scores are normalized, for example, '100%' meaning a best fit and 
'0%' being no fit. 

20 

A matchgroup may be classified as being deterministic or non-deterministic, and 
requiring at least one deterministic match to exist within the overall score, thus 
allowing a more deterministic match than solely a weighted fuzzy type of result. For 
example, a matchgroup designated for 'Job Type' could be marked as deterministic, 

25 and then no profile is included in the overall affinity group unless there is at least a 
score, which may also require to be at least above a predefined threshold value, given 
to the Job Type matchgroup for the profile. Consequently, any score in a non- 
deterministic matchgroup could serve to increase the overall score of a profile, but in 
this case cannot by itself be sufficient to allow the profile to be included in the final fit. 

30 One skilled in the art would recognize that parameters such as this can be considered 
optional in different uses of the system, and are not meant to be limiting, but providing 
further flexibility. 



10 
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ln yet another embodiment of the disclosed invention it is further possible to score on 
ranges of values rather than by an exact match. For example, a matchgroup can 
define a match field for Age Range, having defined ranges of Less than 17, 18-29, 30- 
45, 45-64, and 65+. If the entered profile matches by range with any user profile to be 
compared, the user profile could receive a score for such a matchgroup. It is also 
possible to score based on other Boolean criteria, such as a match field existing in a 
profile and having any value at all, as opposed to not existing in a profile. It is further 
possible to limit matches by a fixed criteria, such as only to allow for matches with 
profiles constrained to a particular arbitrary date range. 



In step S330-50, it is checked whether there are more user profiles not yet matched 
against and if so execution continues with step S330-10. Otherwise, execution 
continues with step S330-60. In step S330-60, a report including the score results of 
the user profiles to which the user profile received in step S310 was matched against 

15 are provided. These results, for example, may be used for the purpose of reporting by 
the apparatus 110 discussed in more detail above. The survey apparatus 110 uses 
this list to determine the affinity group to which a user belongs andto prepare tailored 
reports based, at least in part, on such an association to an affinity group or the 
determination of affinity groups as discussed in more detail above. A person skilled in 

20 the art may modify the disclosed flowchart to fit with the needs of specific programming 
languages, e.g. C++, or description languages, e.g. XML, without departing from the 
scope of the disclosed invention, or further be able to use it on an apparatus other than 
the one described as the apparatus 110. 

25 Reference is now made to Fig. 23 where an exemplary and non-limiting extensible 
markup language (XML) code for profile matching is shown. The code may reside in 
memory code 116 of the apparatus 110. Lines 504 through 507 provide information of 
filters to be applied on the search data. This may include filters such as the state or a 
country in which a user is located or in which the user wishes to be. Another filter may 

30 be the profiles to be searched which can include all the profiles available, or a subset 
thereof. Lines 508 through 514 define sources for user profiles. Such sources may be 
available on database 118. Such sources may be affinity groups created by apparatus 
110. Lines 516-522, 523-527, 528-532, 533-537, 538-541, 542-548, 549-552, 553-557, 
558-561, 562-564, 565-567, 568-577, 578-581, 582-585, 586-590, and 591-594 are 
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examples of matches performed in accordance with the disclosed invention, as made 
applicable to an XML implementation. Lines 595-598 define the ranking ranges so as 
to allow for the grouping of the matching profiles in accordance with the disclosed 
invention. A person skilled in the art may modify the disclosed code to adapt to other 
5 types of searches for the purpose of matching user profiles without departing from the 
scope of the disclosed invention. 



Referring to Fig. 24 an exemplary and non-limiting flowchart 600 for the matching of a 
user profile to an affinity group, is shown. In step S610, an affinity group to be matched 

10 to a user profile, for example the user profile received in step S310 shown in Fig. 3, is 
selected. This affinity group is one that was not matched before during the specific 
execution of the current search and match. The match process entails the 
determination of scores of match groups of the affinity group with respect to the 
received profile. In step S620, a match group of the affinity group and the received 

15 user profile are compared to determine a score. The score may be a binary value, for 
example, '0' meaning low and T meaning high. It is also possible to have a sliding 
degree of fitness with multiple fitness score values, for example, '0' = no match, T = 
bad match, '2' = acceptable match, '3* = good match, and '4' = excellent match. The 
specific score may be further weighted based on the certain match group used. For 

20 example, the match group of location may have a higher weight than that of a job 
position. In step S630, it is checked if more match groups between the affinity group 
and the received user profile are to be checked and if so execution continues with step 
S620. Otherwise, execution continues with step S640. In step S640, an overall fitness 
score respective of the affinity group in comparison to the user profile is calculated. 

25 This is a weighted value that takes into consideration all the match fields compared 
and their respective score values. The overall result is used as a score, indicative of 
the level of fitness between the affinity group and the user profile. In one embodiment 
of this invention these scores are normalized, for example, '100%' meaning a best fit 
and '0%' being no fit. In step S650, it is checked whether there are more affinity groups 

30 not yet matched against and if" so execution continues with step S610. Otherwise, 
execution continues with step S660. In step S660, a report including the score results 
of the affinity groups to which the user profile was matched against are provided. A 
person skilled in the art would be able to extend the apparatus 110 to provide the 
necessary reports by, for example, including the required code in code memory 116. A 
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person skilled in the art may further modify the disclosed flowchart to fit with the needs 
of specific programming languages, e.g. C++, or description languages, e.g. XML, 
without departing from the scope of the disclosed invention. 

Reference is now made to Fig. 25, where an exemplary and non-limiting flowchart 700 
for reporting of results of a matching process is shown. Such a report may be provided 
in response to a goal setting as described above, and in more detail in a U.S. patent 
application entitled AUTOMATED COMPENSATION REPORTS USING ONLINE 
SURVEYS AND COLLABORATIVE FILTERING, Serial Number (unknown), filed on 
the same day and date, assigned to common assignee, and which is hereby 
incorporated in its entirety by this reference thereto. In step S710, a user is asked if an 
extended report is desired and, if so, execution continues with step S730. Otherwise, 
execution continues with step S720, where a standard report is generated and! 
thereafter, execution continues with step S750. The report is generated based on the 
user profile and the respective affinity group associated with that user profile 
generated by the matching system. In step S730, billing information is received from 
the user, for example client 120-1. Such information may include, but is not limited to, 
a purchase order (PO) number, a credit card number, or a debit card number. In step 
S740, an extended report is generated, the report being based on the user profile and 
the respective affinity group associated to that user profile. An extended report may 
provide additional information otherwise not made available to the non-paying user. 
Specifically, such extended reports may include reports of a statistical nature 
respective of a plurality of users. This is the case where a human resource manager 
wants to find out certain statistics about compensation demands common in a certain 
geographical area. In step S750, the extended report or the standard report, as the 
case may be, is sent to the user, for example by means of displaying it as a Web page 
on the terminal of the client 120, as an e-mail attachment, and the like. In another 
embodiment of the invention the client 120 may further provides report parameters that 
influence the desired output of the report, for example, the type of information to be 
displayed, the manner in which such information Is provided, and the like. Such - 
information may be provided by means of defining goals as described in greater details 
in the above mentioned U.S. patent application. 
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Although the invention is described herein with reference to the preferred embodiment, 
one skilled in the art will readily appreciate that other applications may be substituted 
for those set forth herein without departing from the spirit and scope of the present 
invention. Accordingly, the invention should only be limited by the Claims included 
below. 



